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ABSTRACT 



A switching device for forwarding network traffic to a 
desired destination on a network, such as a telephone or 
computer network. The switching device includes multiple 
ports and uses a lookup table to determine which port to 
forward network traffic over. The lookup table includes 
network addresses that are maintained in ascending or 
descending order. The switching device includes multiple 
binary search engines coupled in series including one or 
more precursor binary search engines and a final stage 
binary search engine. Together, the binary search engines 
perform an N iteration binary search. Additionally, a single 
search engine can perform multiple concurrent searches so 
that source and destination addresses can be obtained simul- 
taneously and without wasted memory cycles. 
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NETWORK SWITCHING DEVICE WITH 
PIPELINED SEARCH ENGINES 

CROSS-REFERENCE TO RELATED 
APPLICATION 

[0001] This application is a continuation of U.S. patent 
application Ser. No. 09/166,603, filed on Oct. 5, 1998, 
entitled "Network Switching Device With Pipelined Search 
Engines, which is based on provisional U.S. patent appli- 
cation Serial No. 60/072,280 filed Jan. 23, 1998, and entitled 
"Forwarding Database Lookup Technique." 

FIELD OF THE INVENTION 

[0002] This invention relates generally to networks, such 
as telephone and computer networks, and, more particularly, 
relates to routing information through such networks. 

BACKGROUND OF THE INVENTION 

[0003] A network allows two or more parties to commu- 
nicate with each other. In their simplest form, networks 
generally include transmission lines and switching devices 
(e.g., routers, switches, switching routers, etc.). The trans- 
mission lines carry signals (e.g., electrical, optical, etc.), 
while the switching devices are intermediate stations that 
establish temporary connections between transmission lines. 
In telephone networks, for example, a caller's line goes to a 
switching device where the actual connection is made to the 
called party. In computer networks, devices such as routers 
receive messages on the network and forward the messages 
to their correct destinations. Computer networks can be as 
small as a local area network (LAN) consisting of a few 
computers, printers, and other devices, or it can consist of 
many computers distributed over a vast geographical area 
(e.g., the Internet). 

[0004] An example computer network 10 is shown in 
FIG. 1A. The network includes two local segments 12 and 
14, and connection to a remote network 16. Nodes, labeled 
as A- J, represent computers connected to the local segments. 
A switching device 20 includes three ports 22-24 and 
switches network traffic between segments 12, 14, and the 
remote network 16, Network 16 may also include switching 
devices, such as switching device 21, which then connects 
other segments (not shown) to the network Switching 
device 20 allows the nodes on one segment to communicate 
with nodes on other segments and to other switching 
devices. The nodes communicate with each other through a 
protocol (e.g., HTTP, TCP/IP, SMB, etc.) which allows the 
nodes to transmit and receive network frames (a network 
frame includes a destination address, a source address, and 
a data field). When switching device 20 receives a frame 
from a node, it analyzes the destination address by searching 
a lookup table 26, shown in FIG. IB. Lookup table 26 
includes table entries having a network address field and a 
port field. When the destination address is matched to a 
network address in the lookup table, switching device 20 
determines which port to forward the frame to by obtaining 
the port number corresponding to the matched network 
address. For example, if node A on segment 12 sends a 
message to node H on segment 14, switching device 20 
receives the message from node A and in response searches 
the entries in the network address field of lookup table 26. 
Table entry 28 contains the network address for H. A 



corresponding port field 30 for network address H indicates 
that the frame should be forwarded over port 2. Additional 
background information on switches can be found in a 
number of references, such as Fast Ethernet (1997) by L. 
Quinn et al., Computer Networks (3 rd Ed. 1996 by A 
Tannenbaum, and High-Speed Networking with LAN 
Switches (1997) by G. Held, all of which are incorporated 
herein by reference. 

[0005] The switching device can obtain the network 
addresses for the lookup table in different ways, depending 
on the particular implementation of the switching device. 
For example, the switching device may snoop network 
traJEc so that when a frame is received on a port, the 
switching device determines if the frame's source address is 
in the table and, if it is not, adds an entry containing the 
source address and the inbound port to the table. Thus, the 
switching device is said to "learn" addresses and port 
numbers from any frame that is transmitted by a node. 
Another technique some switching devices (e.g., routers) 
use to obtain the lookup table is from other switching 
devices through a special protocol. Thus, routers supply 
network addresses to each other to supplement their lookup 
tables. 

[0006] Consequently, when a network frame is received in 
a switch, both the source and destination addresses must be 
searched in the lookup table-the source address for "learn- 
ing" and the destination address for forwarding. To search 
the lookup table, a single search engine (not shown) within 
the switch 20 sequentially accesses lookup table entries and 
compares the entries to the destination address in the net- 
work frame. After the search for the destination address is 
completed, a second independent search is performed for the 
source address. An example timing diagram 40 for the 
search engine is shown in FIG. 2. During a first clock cycle 
42, the search engine accesses the lookup table and obtains 
a network address. During a second clock cycle 44, the 
search engine compares a network address obtained from the 
lookup table to the destination address. The first and second 
clock cycles together form a single iteration of the search. If 
there is no match, the search engine loads an address for the 
next lookup table entry to analyze. The process is repeated 
during clock cycles 46 and 48 (a second iteration), and so on 
until a match is found or the search fails. After the search for 
the destination address is completed, a second search is 
performed for the source address. Unfortunately, given 
current memory and ASIC speed limitations, the above- 
described technique for searching the lookup table is too 
slow to meet the requirements of recently-developed gigabit 
switches. 

[0007] Additionally, with sequential searching, the lookup 
table is only accessed every other clock cycle, wasting 
valuable time. 

[0008] An objective of the present invention, therefore, is 
to provide a high-speed network switching device that can 
quickly and efficiently search through address lookup tables 
and that overcomes the limitations of the prior art. SUM- 
MARY OF INVENTION 

[0009] The present invention provides a switching device 
(e.g., router, switch, switching router, telephone switch, etc.) 
that forwards network traffic to a desired destination on a 
network, such as a telephone or computer network. The 
switching device includes multiple ports and uses a lookup 
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table to determine which port to forward network traffic 
over. The network traffic is typically in the form of network 
frames that include source and destination addresses. 

[0010] In one aspect of the invention, the lookup table 
includes network addresses that are maintained in sorted 
order (e.g., ascending or descending order) to facilitate 
binary searches. Additionally, multiple binary search 
engines are connected in series and perform multiple binary 
searches simultaneously. Rather than having each binary 
search engine perform an independent search, the binary 
search engines each perform a predetermined number of 
iterations of an N iteration search. Additionally, each search 
engine has a separate memory, which stores only data (nodes 
from the lookup table) necessary for its respective iterations 
of the search. When a search engine completes its iterations 
of the search, the results are passed to the next search engine 
in the series. The next search engine uses the results from the 
previous search engine as a starting point to performing its 
respective iterations of the binary search. The number of 
search engines connected in series (also called pipelining) 
can vary between two and N, where N is the number of 
iterations needed for a binary search to complete. If N search 
engines are used, each search engine performs only one 
iteration. 

[0011] By pipelining search engines, increased throughput 
is achieved. Additionally, by storing only the data necessary 
for a predetermined number of iterations, very little memory 
space is needed. For example, a lookup table with 64K 
entries requires 16 iterations (2 16 -64K) to search the entire 
table using a binary search. If two search engines are used, 
each search engine performs 8 iterations each. The first 
search engine's memory only stores 256 entries (28), while 
the second search engine stores either all 64K entries or the 
remainder of the entries after extraction of the 256 entries 
from the lookup table. 

[0012] In another aspect of the invention, each search 
engine may perform concurrent source and destination 
searches of the lookup table. That is, during one clock cycle, 
the search engine performs part of a search for the destina- 
tion address of the network frame and during the next clock 
cycle the search engine performs part of a search for a source 
address. Thus, one search engine performs two independent 
searches concurrently. Moreover, while the lookup table is 
being accessed for the destination address, the source 
address from the previous look up is being analyzed so that 
a next source address may be calculated. Likewise, when the 
lookup table is being accessed for the source address, the 
previously obtained destination address is analyzed. The 
concurrent source and destination address lookups allow the 
search engines to search at twice the speed of search engines 
performing linear searches. Additionally, the lookup table is 
accessed during every clock cycle rather than sitting idle 
during some clock cycles. Performing concurrent searches is 
independent of the algorithm used. Thus, binary, linear, 
CAM, binary radix trees, hashing and other types of 
searches can be used. 

[0013] It will be recognized by those skilled in the art that 
the two described aspects of the invention can be performed 
independent of one another. That is, a switching device may 
have just one binary search engine (as opposed to multiple 
search engines in series) that performs concurrent source 
and destination address searches. Additionally, a switching 



device may have multiple search engines connected in series 
wherein the search engines do not perform concurrent 
searches. Alternatively, a switching device may include both 
aspects with search engines coupled in series and with the 
search engines performing concurrent searches. 

[0014] These advantages and other advantages and fea- 
tures of the inventions will become apparent from the 
following detailed description, which proceeds with refer- 
ence to the following drawings. 

BRIEF DESCRIPTION OF DRAWINGS 

[0015] FIG. 1A shows a known network containing mul- 
tiple segments connected through a switching device. 

[0016] FIG. IB shows a known lookup table stored in the 
switching device for identifying nodes connected to the 
segments. 

[0017] FIG. 2 shows a timing diagram of a search engine 
within the switching device of FIG. 1A. 

[0018] FIG. 3 shows a switching device according to one 
aspect of the invention with two binary search engines 
coupled in series. 

[0019] FIG. 4A shows a lookup table in sorted order 
according to another aspect of the invention. 

[0020] FIG. 4B shows the lookup table of FIG. 4A in 
binary tree format and showing four iterations of a binary 
search. 

[0021] FIG. 4C shows a memory configuration for a 
switching device having four search engines with each 
search engine performing one of the iterations of FIG. 4B. 

[0022] FIG. 4D shows search results that are passed 
between search engines from binary searches performed on 
the lookup table of FIG. 4A. 

[0023] FIG. 5 shows a flowchart of a method for forward- 
ing network frames in the switching device of FIG. 3. 

[0024] FIG. 6 shows a flowchart of a method for perform- 
ing different iterations of a binary search in separate binary 
search engines. 

[0025] FIG. 7 shows a detailed circuit diagram of a binary 
search engine according to another aspect of the invention 
where source and destination searches are performed con- 
currently, 

[0026] FIG. 8A shows a flowchart of a method for con- 
currently performing source and destination searches using 
the binary search engine of FIG. 7. 

[0027] FIG. 8B is a timing diagram for the binary search 
engine of FIG. 7. 

[0028] ' FIG. 9 shows another example of a lookup table 
using pointers to access the network addresses. 

DETAILED DESCRIPTION OF AN 
ILLUSTRATIVE EMBODIMENT 

[0029] FIG. 3 shows a switching device 50 having a port 
52 and port intercommunication logic 54. Port 52 includes 
a media interface 56, a primary memory 58, and a search 
engine 60. The search engine 60 includes temporary packet 
storage 62, packet analysis and key extraction logic 64, two 
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internal binary search engines 66, 68, a first stage memory 
70, and forwarding decision logic 72. There are multiple 
ports (not shown) in switching device 50. One or more ports 
are located on channel cards (not shown) mounted in a 
chassis. The number of ports and how the ports and port 
intercommunication logic 54 are mounted within a chassis 
are based on the particular application and are not important 
to the invention. 

[0030] Media interface 56 connects switching device 50 to 
a network (not shown) through a network cable 74. The 
network cable can take a variety of forms (e.g., fiber optic, 
twisted-pair, coaxial, etc.) depending on the type of network. 
A variety of network standards and protocols may be used, 
such as TCP/IP, IPX/SPX, FDDI, ATM, ETHERNET, 
GIGABIT ETHERNET, FAST ETHERNET, Token Ring, 
SONET, 100-base TX, etc. Other network protocols, stan- 
dards, and network cables now existing or later developed 
may be used with the invention, since these particular 
aspects are not important to the invention. Media interface 
56 is a communication link between search engine 60 and 
the network. Thus, media interface 56 allows search engine 
60 to send network frames in any desired format and media 
interface 56 reformats the frames for the particular network. 
Similarly, media interface 56 receives network frames from 
the network and formats the frames so they may be read by 
search engine 60. The media interface used is based on the 
particular application and is not important to understanding 
the invention. 

[0031] Temporary packet storage 62 within search engine 
60 holds the network frame temporarily while other com- 
ponents within the search engine determine where to for- 
ward the network frame to, as is further described below. In 
some circumstances, a decision may be made to not forward 
the packet at all. In such cases, the network frame is never 
passed to the port intercommunication logic. Depending on 
the size of the network frame, the temporary packet storage 
62 may hold several network frames, or, alternatively, only 
a portion of a network frame as it is forwarded to another 
port. 

[0032] Packet analysis and key extraction logic 64 extracts 
the source and destination addresses from the network frame 
and forwards the addresses to the first binary search engine 
66. The packet analysis and key extraction logic 64 may also 
pass additional information to the binary search engine 66, 
such as virtual LAN information that comes with the net- 
work frame or is derived based on the type of network frame. 
This additional information is appended to the source and 
destination addresses to form a key. One skilled in the art 
will recognize that the key may include any desired infor- 
mation, depending on the particular application. For 
example, the key may contain only the destination address, 
or only the source and destination addresses, or the source 
and destination addresses and additional information. For 
simplicity, the key is described genetically below as being 
the source and destination addresses. 

[0033] Binary search engines 66 and 68 are coupled to 
respective memories 58 and 70. The memories together store 
a lookup table that the binary search engines use for ana- 
lyzing network frames received from media interface 56. A 
portion of an example lookup table is shown in FIG. 4 and 
is further described below. A management processor (not 
shown) is coupled to search engines 66, 68 and maintains the 



lookup table in memories 58, 70. Specifically, the manage- 
ment processor may direct the search engines to delete old 
table entries, insert new table entries and generally maintain 
the table in sorted order so that the search engine 60 
performs searches efficiently and correctly. 

[0034] As is described in detail below, binary search 
engines 66, 68 perform a binary search on the lookup table. 
A binary search is a technique for searching a sorted table 
where successive midpoints of the table are determined and 
compared against a search term. Thus, to start the search the 
midpoint of the table is compared against the search term. 
By using the midpoint, the table is effectively divided into 
two parts and a determination is made as to which of the two 
parts the search term must reside in. Only the part that has 
the search term is analyzed and the other part is ignored. The 
midpoint of the pertinent part is then determined to divide 
the table into two more parts, and so on until the key is found 
or the search fails. Binary searches are understood in the art 
and are described in a text entitled Design and Analysis of 
Algorithms by Jeffrey D. Smith, 1989. Binary searches 
require a predetermined number of "iterations" to complete. 
The number of iterations depends on the size of the lookup 
table and follows the formula log 2 N=number of iterations, 
where N is the number of entries in the table. Thus, a 64K 
table requires 16 iterations to complete a binary search, 
while a 256 entry table requires 8 iterations to complete. 

[0035] One aspect of the invention is that the binary search 
engines divide the binary search of the lookup table by each 
performing some of the iterations of the overall search. For 
example, if the lookup table has 64K entries, the binary 
search engine 66 performs the first eight iterations of the 
search and binary search engine 68 performs the last eight 
iterations. Additionally, the first stage memory 70 does not 
contain the entire lookup table. Instead, it only contains 256 
entries needed for the first eight iterations of the search. The 
256 entries are determined and copied from the lookup table 
and include the successive midpoint combinations from the 
lookup table needed for the first eight iterations. Once the 
binary search engine 66 completes its eight iterations, the 
results are passed to binary search engine 68. Binary search 
engine 68 then uses the results as a starting point for its eight 
iterations of the binary search. A substantial amount of 
memory is saved by only using successive midpoint com- 
binations in the memory 70. Additionally, although two 
search engines are shown, any desired number of search 
engines may be used. For example, for a 16 iteration search, 
16 binary search engines can be used with each search 
engine performing one iteration. Alternatively, 4 binary 
search engines may be used with each binary search engine 
performing 4 iterations. Still further, the partitioning of 
iterations across search engines need not be equal. Also, the 
lookup table can be any desired length. Consequently, the 
binary search engines can perform any number of iterations 
depending on the particular application. 

[0036] After binary search engine 68 completes the binary 
search, the results are passed to forwarding decision logic 
72, which examines the results and applies a predetermined 
set of rules to determine whether the network frame should 
be forwarded and which port or ports it should be forwarded 
to. Forwarding decision logic may also examine the level of 
priority of the network frame. Higher priority frames are 
typically forwarded by the switching device 50 before lower 
priority frames. 
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[0037] When the forwarding decision logic 72 determines 
that a frame is to be forwarded to other ports in switching 
device 50, it passes the network frame to the port intercom- 
munication logic 54. Port intercommunication logic 54 
includes a switch fabric 76 and a switch fabric control 78. 
Switch fabric 76 can take a variety of forms. For example, 
the switch fabric can be a cross-bar switch, which is com- 
monly used in telecommunications switching. The cross-bar 
switch creates a path between a receiving port and a trans- 
mitting port so that the network frame may be passed 
therebetween. A wide variety of cross-bar switches may be 
used, such as cut-through switches, interim cut-through 
switches, and store-and-forward switches. Other types of 
switch fabrics may also be used. For example, switch fabric 
76 may also be a central memory using a bus arbitration 
device and a central bus. Using a shared-memory bus 
architecture, all ports access a central located memory pool. 
The ports can access the central memory through a common 
bus when an arbitration device grants access. Another pos- 
sible switch fabric that can be used is a parallel access 
shared -memory architecture. In a parallel access shared 
memory, all ports share a central memory location. How- 
ever, a bus arbitration scheme is not used. Instead, every port 
has a dedicated path into and out of the central memory 
fabric. Therefore, all ports can simultaneously access the 
centralized memory pool at any time. A wide variety of 
existing switch fabrics or later developed switch fabrics may 
also be used. The particular switch fabric and switch fabric 
control is not of importance to the invention. 

[0038] Switch fabric control 78 controls network frames 
as they are passed through the switch fabric. In the case 
where the switch fabric is a cross-bar switch, the switch 
fabric control is typically called a scheduler The scheduler 
establishes a connection within the cross-bar switch so that 
a search engine on one port can directly pass a network 
frame to a search engine on another port. In the case where 
the switch fabric is a memory, the switch fabric control tells 
a receive-side search engine where to store the frame io 
memory. After the frame is stored in memory, the switch 
fabric control signals a transmitting-side port that the net- 
work frame is ready to be transmitted and provides the 
address of the memory location where the frame in located. 
The switch fabric control may also provide priority infor- 
mation to the transmitting-side port. 

[0039] FIG. 5 shows a flow chart of a method used by the 
switching device 50 for forwarding network frames. In step 
80, the media interface 56 receives a network frame from the 
network and passes the frame to search engine 60. The 
network frame contains a destination address that indicates 
the ultimate destination for the network frame. In step 82, 
search engine 60 searches memories 58, 70 to determine 
whether the destination address from the network frame is 
located within the lookup table. If a network address 
matches the destination address, a port that the frame should 
be forward to is also obtained from the lookup table. 
Assuming that the destination address was properly found in 
the lookup table and the appropriate port was determined, 
the search engine passes the search results to switch fabric 
control 78 (step 84). In step 86, search engine 60 transfers 
the network frame through switch fabric 76. If the switch 
fabric is a cross-bar switch, the switch fabric control 78 
establishes the connection in the switch fabric and commu- 
nicates to search engine 60 when to send the network frame. 
If the switch fabric is a central memory with or without bus 



arbitration, the switch fabric control tells search engine 60 
where in the switch fabric to store the network frame. Search 
engine 60 then stores the network frame at the indicated 
location. The switch fabric control also informs the other 
ports of where the network frame is stored so that they may 
properly access it within the switch fabric. Regardless of the 
technique used with the switch fabric, the network frame is 
obtained from the switch fabric and transmitted on one or 
more ports (step 88). 

[0040] FIG. 4A shows an example lookup table 100 
having 16 data entries, 0000B through HUB. Each table 
entry includes two or more fields including, an address field 
102, illustrated as a network address, and a forwarding 
information field 104, illustrated as a port number. Although 
a network address field is shown, other addresses used in 
switching devices may be used. The forwarding information 
field also may contain information other than the port 
number. For example, the forwarding information field may 
analyze information related to the protocol format (e.g., 
MAC) that the address is in to determine which port to 
forward the network frame to. The network address field 102 
contains network addresses in sorted order (e.g., ascending 
or descending order). T^e^ortTnumber.field404ic^Sm^Jhe^ 
gort'associated-witM One or more 

of tfiese^da ita^fieTds may be deleted, or alternatively, addi- 
tional fields may be used. 

[0041] Additional fields may, for example, show a priority 
status associated with the network address. The network 
address field 102 identifies a destination to transmit the 
network frame to, such as a computer_qn a^network. Tne^^ 
gnetworkjiddre^ 

fields2(often called a pay load) in^a^tableZentry^ Address — ^ 
000^SiSpty-so tha^he-lookup-table-co^taipsran olidlZ? 
numbeS of ^entries . The miHjwinl^f'mTTable is at address 
1000B and the lookup table can be equally divided into two 
portions 106 and 108 with equal numbers of data entries. 
The first portion 106 can similarly be divided into two 
portions 110, 112 with a midpoint at address 0100B. The 
second portion 108 also can be divided into two portions 
114, 116 with a midpoint at address 1100B. By determining 
successive midpoints, a binary search tree can be generated. 

[0042] FIG. 4B shows a binary search tree 118 for the 
lookup table 100 of FIG. 4A. The search tree shows four 
iterations, which are the maximum number of iterations 
required for any search on a 16 entry table (i.e., 24=16). The 
tree is formed by determining the successive midpoints, 
called nodes, in the lookup table as described in relation to 
FIG. 4A. To perform a binary search, the search term is 
compared against the network address located at the mid- 
point address 1000B of the lookup table (i.e., 26). If the 
search term is less than 26, it is compared to 14. Alterna- 
tively, if it is greater than 26, it is compared to 38, and so on, 
until a match is found or the search fails. Each comparison 
that is performed is another iteration of the binary search. 
FIG. 4C shows that the different iterations of the binary 
search may be divided amongst 4. different binary search 
engines connected in series. A memory 120 for a first search 
engine only requires one data entry (one node) for the first 
iteration of the binary search, which represents the midpoint 
of the lookup table. A memory 122 for a second search 
engine requires two data entries for the second iteration, 
which represent the midpoints of portions 106 and 108 of the 
lookup table 100 (See FIG. 4A). A memory 124 for a third 
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search engine contains additional successive midpoints of 
the lookup table needed for the third iteration. And a 
memory 126 for a fourth search engine contains the remain- 
ing data entries of the lookup table. The fourth memory 126 
may also contain the entire lookup table 100. The memories 
120, 122, and 124 (called the precursor memories) may 
include only the network address field of the lookup table, 
without any payload fields. Alternatively, the precursor 
memories 120, 122, and 124 may include the payload 
information so that the lookup table is effectively distributed 
amongst many memories, even though it is a unitary data- 
base. In any event, the precursor memories contain succes- 
sive midpoint possibilities (nodes) for the different iterations 
of a binary search and the nodes are stored adjacent to one 
another. The precursor memories could contain additional 
table entries other than just the successive midpoint possi- 
bilities, if desired. 

[0043] FIG. 4D shows an example of data passed between 
the four search engines of FIG. 4C in searching for the 
network address 17 (the key) located at address 0101 in the 
lookup table (see FIG. 4A). As comparisons are made 
during iterations of the binary search, the final address of 
0101 is generated. That is, the first search engine generates 
the most significant bit, the last search engine generates the 
least significant bit, and the intermediate two bits are gen- 
erated by tbe intermediate search engines. When a compari- 
son is made, if the key is less than the network address 
obtained from the lookup table, a zero is passed to the next 
search engine. Alternatively, if the key is greater than the 
network address, a one is passed to the next search engine. 
Thus, with the key being 17, the first search engine compares 
26 (the first iteration) and 17. Since 17 is less than 26, a 0 
is passed to the next search engine in series as indicated at 
128. The 0 represents the most significant bit of the final 
address where the network address 17 resides in the lookup 
table. The next search engine then compares 17 against 14, 
since 38 is no longer a possibility. Since 17 is greater than 
14, the next bit in the address is a 1. Consequently, a 1 is 
appended to the most significant bit and a 01 is passed to the 
next search engine as indicated at 130. The next search 
engine then compares 17 and 20. Since 17 is less than 20, a 
0 is appended to the previous passed address resulting in a 
010 being passed to search engine 4, as indicated at 132. The 
last search engine must then compare 17 against 17 to obtain 
an address of 0101, which is the appropriate address in the 
lookup table (See FIG. 4A at 134). 

[0044] FIG. 6 shows a flowchart of a method for perform- 
ing binary searches as described in FIGS. 4A-4D. In step 
136, multiple search engines are coupled in series. Virtually 
any number of search engines may be used. For example, 
FIG. 3 shows two search engines, while FIG. 4C describes 
using four search engines. The invention can easily be 
extended to eight, sixteen, or any other number of search 
engines. Each search engine that performs iterations of the 
binary search has its own separate memory (step 138). The 
memories for the search engines store nodes (the network 
address field with or without the payload information) 
needed for that search engine to perform its respective 
iterations of the binary search (step 140). The nodes are the 
successive midpoint possibilities of the lookup table in 
binary tree format. In step 142, a first search engine in the 
series performs a predetermined number of iterations of the 
binary search. The results of the iterations are then passed to 
the next search engine in the series (step 144). In step 146, 



a check is performed to see whether the results are being 
passed to the last search engine in the series. If not, then 
steps 142 and 144 are repeated. If yes, then the final 
iterations of the binary search are performed and the results 
are passed to the forwarding decision logic (step 148). 

[0045] Although a wide variety of binary search engines 
can be used in the method and apparatus described in FIGS. 
1-6, a particular search engine is shown in relation to FIGS. 
7, 8A, and 8B. This search engine can also be used inde- 
pendently of the method and apparatus of FIGS. 1-6. In 
particular, the search engine can be used with searching 
techniques other than binary searches including linear 
searches, CAM, binary radix trees, hashing, etc. 

[0046] Prior search engines perform a search for the 
source address from the network frame and, only when the 
source address search is completed, perform a search for the 
destination address (See FIG. 2). The present invention, by 
contrast, performs both source and destination address 
searches concurrently. FIG. 7 shows a search engine 160 
coupled to a memory 162 containing a lookup table. The 
search engine 160 includes a source address comparator 164 
and a destination address comparator 166. The source 
address comparator 164 has one input coupled to a source 
address (SA) search key. This SA search key is the source 
address obtained from the network frame and passed to the 
binary search engine 160 by the packet analysis and key 
extraction logic 64 (see FIG. 3). A second input to com- 
parator 164 is coupled to a source address data register 168 
that contains a network address most recently obtained from 
memory 162 during a previous iteration of the source 
address search. The destination address comparator 166 also 
has two inputs with one coupled to a destination address 
(DA) search key obtained from the network frame and the 
other input coupled to a destination address data register 170 
containing a network address obtained from the memory 162 
during a previous iteration of the destination address search. 
The output of the comparators 164, 166 are coupled to 
respective next address calculation logic circuits 172, 174. 
The next address calculation logic circuits 172, 174 use the 
lookup table address from the previous search iteration and 
the results of the comparators 164, 166 to determine the next 
lookup table address to analyze. The logic circuits 172, 174 
make such a determination of the next address based on the 
nodes (i.e., successive midpoints) of the lookup table in the 
case of a binary search. In the case, of a linear search, the 
next address is simply the previous address plus one. Thus, 
the logic circuits 172, 174 perform different functions based 
on the application. The outputs of logic circuits 172, 174 are 
coupled to respective SAand DA address registers 176, 178. 
The outputs of the address registers 176, 178 are coupled to 
multiplexer 180 and are also fed back into next address 
calculation logic 172, 174, respectively. The multiplexer 180 
is coupled to a clock line 182 that switches a multiplexer 
output so that the contents of the SA address register 176 and 
the DA address register 178 are passed to memory 162 in an 
alternating fashion. The contents of the SA and DA address 
registers 176, 178 contain an address of a lookup table entry 
and when applied to memory 162 causes the memory to 
output data (i.e., a network address) on a data output 184. 
Clock 182 is also coupled to both data registers 168, 170 and 
is synchronized so that data is latched into source data 
register 168 when the multiplexer 180 passes the contents of 
source address register 176. Similarly, data register 170 
latches the data output 184 of memory when the address 



03/10/2004, EAST version: 1.4.1 



US 2003/0058864 Al 



6 



Mar. 27, 2003 



stored in DA address register 178 passes through the mul- 
tiplexer 180. Search termination logic 185 determines when 
a search is complete by analyzing outputs of the address 
registers 176, 178 and the comparators 164, 166. An output 
of the search termination logic 185 can be passed to the 
forwarding decision logic 72 (see FIG. 2) or to another 
search engine connected in series to signal when a search is 
complete and to pass the results of the search. 

[0047] FIG. 8B shows a timing diagram for the circuit of 
FIG. 7. A clock signal 186 is at twice the frequency of the 
clock on clock line 182. Three cycles 188, 190, and 192 of 
clock signal 186 are shown. During a first clock cycle 188, 
the SA address register 176 (FIG. 7) is already loaded with 
a lookup table address and the multiplexer 180 passes that 
address to the memory 162. Consequently, memory 162 
outputs data (a network address) on data output 184 corre- 
sponding to the address in the SA address register. At the end 
of cycle 188, the source data register 168 is loaded with the 
data from the memory 162. Also during cycle 188, logic 
circuit 174 is calculating the next lookup table address to 
analyze for the destination address search and the results are 
stored in DA address register 178. Clock cycle 188 repre- 
sents the start of a source address search iteration. 

[0048] During clock cycle 190, comparator 164 compares 
the SA search key and the SA data register 168. The next 
address calculation logic 172 determines the next lookup 
table address to analyze for the search, and the results are 
loaded into the SA address register 176, Also during clock 
cycle 190, the output of the DA address register is passed 
through multiplexer 180 and the corresponding data from 
memory 162 is latched into DA data register 170. Clock 190 
represents the start of a destination address search iteration. 
At the end of clock cycle 190, the source address search 
iteration is completed. Thus, clock cycles 188, 190 represent 
a complete source address search iteration in a search 
requiring multiple iterations. 

[0049] During clock cycle 192, another source address 
search iteration is started and the destination address search 
iteration is completed. The comparator 166 compares the 
DA search key and the output of the DA data register 170. 
The next address calculation logic 174 calculates the next 
lookup table address to examine and loads the calculated 
address into the DA address register. Thus, during the three 
clock cycles 188, 190, and 192 both a source and destination 
address search iterations are performed concurrently. Addi- 
tionally, the source and destination address searches are 
independent of each other. 

[0050] The hardware of FIG. 7 can be modified while 
maintaining the same timing diagram of FIG. 8B. For 
example, the multiplexer 180 can be removed and the 
address registers 176, 178 can have tristate outputs with the 
clock line 182 coupled to the address registers 176, 178. 
Thus, the clock switches the address registers 176, 178 in 
alternating fashion onto a common bus attached to memory 
162. Other modifications can also be made to the hardware, 
but the general timing diagram of FIG. 8B should remain 
the same. 

[0051] FIG. 8A is a flowchart summarizing the steps for 
concurrently performing source and destination address 
searches. During a first clock cycle, a network address is 
obtained from the lookup table as a first phase of a source 
address iteration (step 194). During a second clock cycle 



(step 196), the network address is compared to the SA search 
key, The next address calculation logic 172 reads the com- 
parator 164 and determines what lookup table address 
should be accessed next. Also during the second cycle, the 
lookup table is accessed for the next network address needed 
for a first phase of a destination address iteration and the 
source address iteration is completed. Finally, during a third 
cycle (step 198), the destination address iteration is com- 
pleted by resolving the next network address to analyze for 
the destination address search. Also, the first phase of the 
source address iteration is repeated. Although only three 
clock cycles are shown, the clock is continuous. 

[0052] A wide variety of searching techniques can be used 
with the concurrent search embodiment including binary, 
linear, CAM, binary radix trees, hashing and other types of 
searches. 

[0053] FIG. 9 shows another example of a lookup table 
200 that may be used according to the invention. The lookup 
table includes a list of pointers 202 and network address data 
204 stored in the memory within the switching device. The 
pointers point to the network address data 204. An example 
of actual memory addresses is shown in column 206 for 
illustrative purposes. Notably, neither the network addresses 
204 nor the list of pointers 202 are stored in sorted order. 
However, the contents of data pointed to by the list of 
pointers 202 are stored in sorted order For example, a 
pointer 208 points to address 107, which contains network 
address 159:22:0. A next pointer 210 in the list points to 
address 108 which contains the network address 159:22:99. 
The next pointer 212 points to address 106 which contains 
the network address 159:66:123. Although the list of point- 
ers 208, 210, and 212 are not in order (since the pointers are 
107, 108, 106, respectively) the data pointed to by the 
pointers is in order since addresses 159:22:0, 159:22:99 and 
159:66:123 are in ascending order. 

[0054] Thus, the lookup table according to the invention 
does not need to have data stored in contiguous addresses. 
Instead, network addresses may be in one memory location 
and forwarding information (such as port numbers) may be 
stored in other locations of memory. Pointers or some other 
technique for Unking associated data is then used. 

[0055] Having described and illustrated the principles of 
our invention with reference to preferred embodiments 
thereof, it will be apparent that these embodiments can be 
modified in arrangement and detail without departing from 
the principles of the invention. 

[0056] For example, although the port is shown as includ- 
ing separate components, such components can be formed in 
a single integrated circuit. Additionally, other circuit com- 
ponents, such as the port intercommunication logic 54 can 
also be included in the same integrated circuit as the port 52. 

[0057] Additionally, any of the components of the switch- 
ing device can be performed by hardware, software, or a 
combination thereof. The invention should not be limited to 
the particular technique (whether hardware or software) for 
carrying out the methods and apparatus described herein. 
For example, the search engine can be a microprocessor 
running software or an ASIC where the searches are per- 
formed in hardware. 

[0058] Further, although the search engines are generally 
shown performing equal iterations of binary searches, it is 
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not necessarily so. That is, the search engines can perform 
unequal iterations of searches. For example, in a 16 iteration 
search, a first search engine can perform 9 iterations and a 
second search engine can perform 7 iterations. Other varia- 
tions in the number of iterations can easily be applied and 
depends on the particular application and number of search 
engines used. 

[0059] Still further, although the search engine of FIG. 7 
shows two comparators and two next address calculation 
logic circuits, one of each may be used instead, with the 
clock controlling whether the SA address register or the DA 
address register is read by the next address calculation logic 
circuit. Additionally, the clock controls the single compara- 
tor so that only one of the DA data register and SA data 
register are compared at a time to one of the SA and DA 
search keys. 

[0060] Still yet further, although the lookup table is shown 
as having all memory locations filled, in practice this is not 
generally the case. Instead, the memory has only a portion 
filled with lookup table values and the remaining portion is 
padded with dummy values. For example, the table of FIG. 
4A may have only 9 lookup table entries with the remaining 
memory locations containing dummy values. In the case of 
an ascending table, the dummy values are a number suffi- 
ciently large enough that no network address can be con- 
fused with them. In the case of a descending table, the 
dummy values are a number sufficiently small enough that 
no network address can be confused with them. Addition- 
ally, the lookup table of FIG. 4A contains only 16 entries for 
illustration. In practice, lookup tables contain hundreds of 
thousands of entries. 

[0061] Still farther, although the lookup table is shown as 
containing table entries with multiple fields, the lookup table 
may be a list of pointers, and the pointers point to the 
network addresses and other fields. Additionally, the lookup 
table can be any type of data structure or array that store 
data. 

[0062] Yet further, the network packets can include any 
kind of data including video images, voice data during a 
phone call, a document, etc. 

[0063] Still further, when a port receives a network frame, 
it can analyze the destination address (at layer two and layer 
three) and it can also analyze layer 4 policy information if . 
desired. 

[0064] Still yet further, the present invention may be 
applied to LAN's, WAN's, the Internet, Intranets, telephone 
networks, or any other network. 

[0065] Additionally, although the lookup table is 
described as containing network addresses, the lookup table 
may include other keys. A key may include a network 
address exclusively or in combination with additional infor- 
mation. Alternatively, a key may include lookup information 
other than network addresses. Additionally, the switching 
device may receive network frames that include search keys. 
Alternatively, the search key may be derived from the 
network frame. For example, the search engine may use 
characteristics of the network frame, such as what port it was 
received on or what protocol format it is in, to derive the 
search key. The search key, which is either explicitly 
included in the network frame or derived therefrom, is 
compared to the lookup table which includes lookup keys. 



[0066] In view of the many possible embodiments to 
which the principles or invention may be applied, it should 
be recognized that the illustrated embodiment is only a 
preferred example of the invention and should not be taken 
as a limitation on the scope of the invention. Rather, the 
invention is defined by the following claims. We therefore 
claim as the invention all such embodiments that come 
within the scope of these claims. 

1. A network switching device comprising: 
a first search engine; 

a second search engine coupled to the first search engine; 

a first memory coupled to the first search engine; and 

a second memory coupled to the second search engine, 
characterized in that the first search engine receives a 
search key and performs a first search of the first 
memory for the search key, the first search engine 
passing results of the first search to the second search 
engine, the second search engine performing a second 
search of the second memory for the search key and 
appending results of the second search to the results of 
the first search. 

2. The network switching device of claim 2, wherein the 
first memory includes a portion of a lookup table needed for 
performing the first search. 

3. The network switching device of claim 2, wherein the 
second memory includes a portion of a lookup table needed 
for performing the second search. 

4. The network switching device of claim 2, wherein the 
second memory includes all entries of a lookup table. 

5. In a network switching device, a method for forwarding 
a discrete information unit comprising: 

generating a search key associated with the information 
unit; 

searching a first memory for the search key using a first 
search engine and rendering a first search result; 

passing the first search result to a second search engine; 

searching a second memory for the search key using the 
second search engine and rendering a second search 
result; 

appending the second search result to the first search 
result; 

retrieving forwarding information based on the appended 
search result; and 

forwarding the information unit or not based on the 
retrieved forwarding information. 

6. The method of claim 5, wherein the first memory 
includes a portion of a lookup table needed for performing 
the first search. 

7. The method of claim 5, wherein the second memory 
includes a portion of a lookup table needed for performing 
the second search. 

8. The method of claim 7, wherein the second memory 
includes all entries of a lookup table. 

9. A network switching device comprising: 

a first search engine; 

a second search engine coupled to the first search engine 
in series; 
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a first memory coupled to the first search engine; and 

a second memory coupled to the second search engine, 
characterized in that the first search engine receives a 
search key and performs a first search of the first 
memory for the search key, the first search engine 
passing results of the first search to the second search 
engine, the second search engine performing a second 
search of the second memory for the search key and 
appending results of the second search to the results of 
the first search. 

10. The network switching device of claim 9, wherein the 
first memory includes a portion of a lookup table needed for 
performing the first search. 

11. The network switching device of claim 9, wherein the 
second memory includes a portion of a lookup table needed 
for performing the second search. 

12. The network switching device of claim 9, wherein the 
second memory includes all entries of a lookup table. 

13. In a network switching device, a method for forward- 
ing a discrete information unit comprising; 

generating a search key associated with the information 
unit; 

searching a first memory for the search key using a first 
search engine and rendering a first search result; 



passing the first search result to a second search engine 
coupled to the first search engine in series; 

searching a second memory for the search key using the 
second search engine and rendering a second search 
result; 

appending the second search result to the first search 
result; 

retrieving forwarding information based on the appended 
search result; and 

forwarding the information unit or not based on the 
retrieved forwarding information. 

14. The method of claim 13, wherein the first memory 
includes a portion of a lookup table needed for performing 
the first search. 

15. The method of claim 13, wherein the second memory 
includes a portion of a lookup table needed for performing 
the second search. 

16. The method of claim 15, wherein the second memory 
includes all entries of a lookup table. 

* * * * * 
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